GRAPH: A Domain Ontology-driven Semantic Graph Auto Extraction System

نویسندگان

  • Chunying Zhou
  • Huajun Chen
  • Jinhuo Tao
چکیده

This paper presents sGRAPH – a domain ontology-driven semantic graph auto extraction system used to discover knowledge from text publications in traditional Chinese medicine. The traditional Chinese medicine language system (TCMLs), composed of an ontology schema and a knowledge base containing 153,692 words and 304,114 relations, is used as the domain ontology. The sGRAPH comprises two components: a user interface that interacts with users and the domain ontology-based semantic graph extraction algorithm. This algorithm is divided into five steps: text processing, semantic graph extraction, graph identification, keyword-based semantic graph search and the selectable enrichment to the knowledge base. When the knowledge base of TCMLs is used, the domain-specific words are extracted from sentences more accurately; and the hierarchical structure of the ontology can also be used to help identify the extracted graphs. The algorithm not only can extract relations between words that have already been annotated by relations in the knowledge base but also can predict the relations between words that have never been annotated by relations. The sGRAPH was developed and evaluated by extracting semantic graphs from 2000 publications which predicted 6778 relations that have never been found.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

TSINGHUA SCIENCE AND TECHNOLOGY Ontology-driven Mashup Auto-completion on the Data API Network

Abstract: This paper presents an ontology-driven mashup auto-completion system that consists of two technical components: building a data API network and ontology-driven mashup auto-completion on this network. A Microformats-based ontology is firstly defined to describe attributes and activities of data APIs. A semantic subgraph template is proposed to describe all three types of information so...

متن کامل

Centralized Clustering Method To Increase Accuracy In Ontology Matching Systems

Ontology is the main infrastructure of the Semantic Web which provides facilities for integration, searching and sharing of information on the web. Development of ontologies as the basis of semantic web and their heterogeneities have led to the existence of ontology matching. By emerging large-scale ontologies in real domain, the ontology matching systems faced with some problem like memory con...

متن کامل

Presenting a method for extracting structured domain-dependent information from Farsi Web pages

Extracting structured information about entities from web texts is an important task in web mining, natural language processing, and information extraction. Information extraction is useful in many applications including search engines, question-answering systems, recommender systems, machine translation, etc. An information extraction system aims to identify the entities from the text and extr...

متن کامل

Query Architecture Expansion in Web Using Fuzzy Multi Domain Ontology

Due to the increasing web, there are many challenges to establish a general framework for data mining and retrieving structured data from the Web. Creating an ontology is a step towards solving this problem. The ontology raises the main entity and the concept of any data in data mining. In this paper, we tried to propose a method for applying the "meaning" of the search system, But the problem ...

متن کامل

Semantic Graph Mining for e-Science

In this paper, we present a methodology, called Semantic Graph Mining, for computer-aided extraction of actionable rules from consolidated semantic graphs of statements. First, generate semantic annotations of a set of heterogeneous knowledge/information resources in terms of domain ontology. Second, merge a semantic graph by means of semantic integration of the annotated resources. Third, disc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011